Scene Text Detection Using Attention with Depthwise Separable Convolutions
نویسندگان
چکیده
In spite of significant research efforts, the existing scene text detection methods fall short challenges and requirements posed in real-life applications. natural scenes, segments exhibit a wide range shape complexities, scale, font property variations, they appear mostly incidental. Furthermore, computational requirement detector is an important factor for real-time operation. To address aforementioned issues, paper presents novel using deep convolutional network which efficiently detects arbitrary oriented complex-shaped from scenes predicts quadrilateral bounding boxes around segments. The proposed designed U-shape architecture with careful incorporation skip connections to capture complex attributes at multiple scales. For addressing input processing, uses MobileNet model as backbone that on depthwise separable convolutions. design integrated attention blocks enhance learning ability our detector, where are based efficient channel attention. trained multi-objective formulation supported by text-aware non-maximal procedure generate final box predictions. On extensive evaluations ICDAR2013, ICDAR2015, MSRA-TD500, COCOText datasets, reports F-scores 0.910, 0.879, 0.830, 0.617, respectively.
منابع مشابه
Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions
Depthwise convolutions provide significant performance benefits owing to the reduction in both parameters and mult-adds. However, training depthwise convolution layers with GPUs is slow in current deep learning frameworks because their implementations cannot fully utilize the GPU capacity. To address this problem, in this paper we present an efficient method (called diagonalwise refactorization...
متن کاملNatural scene text localization using edge color signature
Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...
متن کاملObject Attention Patches for Text Detection and Recognition in Scene Images using SIFT
Natural urban scene images contain many problems for character recognition such as luminance noise, varying font styles or cluttered backgrounds. Detecting and recognizing text in a natural scene is a difficult problem. Several techniques have been proposed to overcome these problems. These are, however, usually based on a bottom-up scheme, which provides a lot of false positives, false negativ...
متن کاملReading Scene Text with Attention Convolutional Sequence Modeling
Reading text in the wild is a challenging task in the field of computer vision. Existing approaches mainly adopted Connectionist Temporal Classification (CTC) or Attention models based on Recurrent Neural Network (RNN), which is computationally expensive and hard to train. In this paper, we present an end-to-end Attention Convolutional Network for scene text recognition. Firstly, instead of RNN...
متن کاملText Detection in Natural Scene Images using Spatial Histograms
In this paper, we present a texture-based text detection scheme for detecting text in natural scene images. This is a preliminary work towards a complete system of text detection, localization and recognition in order to help visually impaired persons. We have employed spatial histograms computed from gray-level co-occurrence matrices for texture coding and three classifiers have been evaluated...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2022
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app12136425